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ABSTRACT 

In order to interpret H 2 quasar absorption line observations of damped Lya systems 
(DLAs) and sub-DLAs, we model their H2 abundance as a function of dust-to-gas 
ratio, including H2 self-shielding and dust extinction against dissociating photons. 
Then, we constrain the physical state of gas by using H2 data. Using H2 excitation 
data for DLA with H2 detections, we derive a gas density 1.5 < log ri [cm -3 ] < 2.5, 
temperature 1.5 < logT [K] < 3, and internal UV radiation held (in units of the 
Galactic value) 0.5 < logx < 1.5. We then find that the observed relation between 
molecular fraction and dust-to-gas ratio of the sample is naturally explained by the 
above conditions. However, it is still possible that H2 deficient DLAs and sub-DLAs 
with H2 fractions less than ~ 10~ 6 are in a more diffuse and warmer state. The efficient 
photodissociation by the internal UV radiation field explains the extremely small H2 
fraction (< 10~ 6 ) observed for k < 1/30 (k is the dust-to-gas ratio in units of the 
Galactic value); H2 self-shielding causes a rapid increase and the large variations of 
H2 abundance for k > 1/30. We finally propose an independent method to estimate 
the star formation rates of DLAs from H2 abundances; such rates are then critically 
compared with those derived from other proposed methods. The implications for the 
contribution of DLAs to the cosmic star formation history are briefly discussed. 
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1 INTRODUCTION 

Damped Lya clouds (DLAs) are quasar (QSO) absorption 
line systems whose neutral hydrogen column density is larger 
than ~ 1 - 2 x 10 20 cm" 2 (e.g. Prochaska & Wolfe 2002). 
DLAs absorb Lya photons at the restframe of the DLAs. Be- 
cause QSOs are generally luminous, DLAs provide us with 
unique opportunities to trace high-redshift (high-2) galaxy 
evolution. In particular, by identifying absorption lines of 
various species, the physical condition of the interstellar 
medium (ISM) of DLAs has been deduced. 

From the analyses of various absorption lines, evidence 
has been found for the existence of heavy elements in DLAs 
(e.g. Lu et al. 1996). The evolution of metal abundance in 
DLAs can trace the chemical enrichment history of present 
galaxies. Based on this, and on other clues, DLAs have been 
suggested to be the progenitors of nearby galaxies; the sim- 
ilar values of the baryonic mass density in DLAs around 
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redshift 2 ~ 2 and the stellar mass density at z ~ has 
further supported this idea (Lanzetta, Wolfe, & Turnshek 
1995; Storrie-Lombardi & Wolfe 2000). By adopting a re- 
cently favoured ACDM cosmology, however, Peroux et al. 
(2003) argue that the comoving density of H I gas at z ~ 2 
is smaller than the comoving stellar mass density at z ~ 0. 
Yet they strengthen the importance of DLAs, showing that 
a large fraction of H I gas is contained in DLAs at z ~ 2-3. 

In general, a certain fraction of metals condenses onto 
dust grains. Indeed, Fall, Pei, & McMahon (1989) have sug- 
gested that the reddening of background quasars indicates 
typical dust-to-gas ratios of ~ 1/20-1/4 of the Milky Way 
(see also Zuo et al. 1997, but see Murphy & Liske 2004). The 
depletion of heavy elements also supports the dust content 
in DLAs (Pettini et al. 1994; Vladilo 2002). The existence 
of dust implies the possibility that the formation of hydro- 
gen molecules (H2) is enhanced because of the H2 grain sur- 
face reaction (Lanzetta, Wolfe, & Turnshek 1989). Hirashita 
& Ferrara (2002) argue that the enhancement of molecular 
abundance results in an enhancement of the star formation 
activity in the early evolutionary stages of galaxy evolution, 
because stars form in molecular clouds. The important role 
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of dust on the enhancement of the H2 abundance is also sug- 
gested by observations of DLAs (Ge, Bechtold, & Kulkarni 
2001; Ledoux, Petitjean, & Srianand 2003) and in the lo- 
cal Universe, e.g. in Galactic (Milky Way) halo clouds (e.g. 
Richter et al. 2003) and in the Magellanic Clouds (Richter 
2000; Tumlinson et al. 2002). 

Although the H2 fraction (the fraction of hydrogen nu- 
clei in the form of H2; see equation 1111 is largely enhanced 
for some DLAs, stringent upper limits are laid on a sig- 
nificant fraction of DLAs in the range < 10- 7 -10 -5 (Black, 
Chaffee, & Foltz 1987; Petitjean, Srianand, & Ledoux 2000). 
This can be interpreted as due to a low formation rate of H2 
in dust-poor environments relative to the Milky Way (Lev- 
shakov et al. 2002; Liszt 2002) and high H2 dissociation rate 
by strong ultraviolet (UV) radiation (e.g. Petitjean et al. 
2000). However, we should keep in mind that such upper 
limits do not exclude the existence of molecule-rich clouds 
in these systems, because molecular clouds may have a very 
low volume filling factor. Indeed, based on the hydrodynam- 
ical simulation of Wada & Norman (2001), Hirashita et al. 
(2003) show that under a strong UV field typical of high-z 
and a poor dust content (~ 1/10 of the Galactic dust-to-gas 
ratio), H2-rich regions are located in very clumpy small re- 
gions. In such a situation, it is natural that molecular clouds 
are hardly detected in DLAs. 

The probability of detecting H2 is higher for DLAs 
with larger dust-to-gas ratio or larger metallicity. Indeed, 
H2 tends to be detected for metal-rich DLAs (Ledoux et 
al. 2003). The correlation between dust-to-gas ratio and H2 
abundance for DLAs indicates that H2 predominantly forms 
on dust grains as in the Galaxy (e.g. Jura 1974). Since the 
H2 formation and destruction rates are affected mainly by 
gas density, dust-to-gas ratio, and UV radiation intensity, we 
can derive or constrain those quantities for DLAs based on 
H2 abundance. Those quantities also enable us to draw con- 
clusions about other important quantities such as cooling 
and heating rates, star formation rate (SFR), etc. (Wolfe, 
Prochaska, & Gawiser 2003a). 

Our main aim in this paper is to investigate what we 
can learn from the recent H2 observations of DLAs and sub- 
DLAs. (Although we mainly focus on DLAs, we also include 
low column density systems, which are called sub-DLAs.) 
In particular, we focus on key quantities for H2 formation 
and destruction (i.e. dust-to-gas ratio and UV intensity, re- 
spectively). The physical state of H2 reflects the physical 
state of gas, especially, gas density and temperature. In this 
work, we concentrate on H2 data to derive those physical 
quantities. Recent FUSE (Far Ultraviolet Spectroscopic Ex- 
plorer) observations of the Galactic ISM (e.g. Gry et al. 
2002; Marggraf, Bluhm, & de Boer 2004), the Galactic halo 
clouds (e.g. Richter et al. 2003), and the Magellanic Clouds 
(e.g. Tumlinson et al. 2002; Richter, Sembach, & Howk 2003; 
Andre et al. 2004) are successful in deriving the physical 
state of gas from H2 absorption line data. We focus on DLAs 
and sub-DLAs to investigate the high-z universe. Since re- 
cent observations suggest that the local UV radiation orig- 
inating from star formation within DLAs is stronger than 
the UV background intensity (Ledoux, Srianand, & Petit- 
jean 2002; Wolfe et al. 2003a), we include the local UV field 
in this work. We call the local UV field "interstellar radiation 
field (ISRF)". 

Since it is still unclear whether DLAs are large proto- 



galactic discs (Wolfe et al. 1986; Prochaska & Wolfe 1998; 
Salucci & Persic 1999), small sub-L* galaxies (Gardner et 
al. 2001; M0ller et al. 2002; Okoshi et al. 2004), protogalac- 
tic clumps (Haehnelt, Steinmetz, & Rauch 1998; Ledoux et 
al. 1998), or a mixture of various populations (Burbidge et 
al. 1996; Cen et al. 2003; Rao et al. 2003), we adopt a sim- 
ple model which nevertheless includes all relevant physical 
processes and derive robust general conclusions. 

We first describe the model we use to derive the molec- 
ular content in DLAs (Section After describing the ob- 
servational sample adopted in this paper (Section [3J , we 
compare our results with the data and constrain the physi- 
cal conditions of DLAs (Section 0J. Based on these results, 
we extend our discussion to the SFR (Section |KJ, and we 
finally give a summary of this paper (Section |SJ. 



2 MODEL 

Our aim is to investigate the physical conditions in the ISM 
of DLAs by treating H2 formation and destruction for a 
statistical sample. For the homogeneity of our analysis, we 
concentrate our interest on H2. Our aim is not to analyse 
the data of each object in detail by using various absorption 
lines of various species, since different lines may originate 
from different places and detected lines are different from 
object to object. Our model is analytical for the simple ap- 
plication to a large statistical sample. There are a lot of 
works that treat details of various gas state by fitting the 
observational results of absorption lines of various atoms and 
molecules. Therefore, our model may be too simple to derive 
a precise physical quantities for each object. However, our 
analysis has the following advantages for the statistical pur- 
pose: (i) A large sample is treated homogeneously, since we 
concentrate only on H2 in an analytic way. (ii) The results 
directly conclude the statistical properties of DLAs and are 
not affected by details and peculiar situations of each object. 

The relevant physical quantities are those concerning 
the H2 formation and destruction, that is, molecular frac- 
tion, dust-to-gas ratio, UV radiation field, and gas density 
and temperature. Observationally, the molecular fraction, 
the dust-to-gas ratio and the H I column density are rela- 
tively well known, but the UV radiation field, and the gas 
density and temperature are poorly constrained. Thus, we 
first constrain the reliable ranges in those quantities by re- 
viewing H2 detected objects. Then, the likelihood of those 
parameters are discussed by using a statistical sample. 



2.1 H2 formation and destruction 

For the metallicity range typical of DLAs, we can assume 
equilibrium between H2 formation and destruction, because 
the timescale of H2 formation and destruction is well below 
the dynamical timescale (Hirashita et al. 2003). We adopt 
the formation rate of H2 per unit volume and time, -Rdust, 
by Hollenbach & McKee (1979) (see also Hirashita et al. 
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2002) 1 : 

R dust = 4.1 x 10~ 17 5 d (T) 
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where a is the radius of a grain (assumed to be spherical 
with a radius of 0.1 /im unless otherwise stated), T> is the 
dust-to-gas mass ratio (varied in this paper; the typical value 
in the solar vicinity of the Milky Way is 10 -2 ), 8 is the grain 
material density (assumed to be 2 g cm -3 in this paper), and 
Sd {T) is the sticking coefficient of hydrogen atoms onto dust. 
The sticking coefficient is given by (Hollenbach & McKee 
1979; Omukai 2000) 

S d (T) = [l + 0.04(T + T d ) a5 + 2 x 10~ 3 T + 8 x 10 

x [l + exp(7.5 x 10 2 (l/75- 1/Td))]" 1 , (2) 

where Td is the dust temperature, which is calculated by as- 
suming the radiative equilibrium in equation (1121 . However, 
since the reaction rate is insensitive to the dust temperature 
as long as Td < 70 K, the following results are not affected by 
the dust temperature. In fact, Td never exceeds 70 K under 
the radiation field intensity derived in this paper. The H2 
formation rate per unit volume is estimated by -Rdust nnn, 
where n is the gas number density, and njj. is the number 
density of H I. 

The H2 formation rate on dust grains is still to be de- 
bated. It depends on grain size (eq. 0. If the grain size is 
much smaller than 0.1 /im, the H2 formation rate is largely 
enhanced. There are uncertainties also in Sd(T), which could 
depend on the materials of dust. Cazaux & Tielens (2002) 
also suggest that Sd(T) in equation Q should be substituted 
by eH 2 Sd(T), where eh 2 is the recombination efficiency (the 
fraction of the accreted hydrogen that desorbs as H2). The 
recombination efficiency is en 2 ~lif5<T<30K, and en 2 
becomes ~ 0.2 at T ~ 100 K. If the temperature is higher 
than ~ 300 K, 6h 2 ~ 0. However, if en 2 ~ 0.2 is multiplied 
to the H2 formation rate at T = 100 K, the grain formation 
rate becomes significantly small, and the Galactic H2 forma- 
tion rate derived for T ~ 100 K by Jura (1974) cannot be 
achieved; we have to keep in mind that the theoretical deter- 
mination of i?dust is affected by some uncertainty. Thus, in 
this paper, we adopt equation Q to assure that it provides 
the Galactic H2 formation rate if we adopt a ~ 0.1 /xm, 
V ~ 0.01, T ~ 100 K, and 5 ~ 2 g cm" 3 . Since it is im- 
portant to recongnise that the temperature dependence of 
-Rdust is uncertain, we do not deeply enter discussions on the 
temperature dependence in this paper. 

The photodissociation rate Rdiss is estimated as (Abel 
et al. 1997) 



Rdii 



(4tt)U x 10 8 J L wSs hic id s" 1 



(3) 



where Jlw (erg s _1 cm -2 Hz" 1 sr" 1 ) is the UV intensity at 
hi; = 12.87 eV averaged over the solid angle, Sshicid is the 
correction factor of the reaction rate for H2 self-shielding 



1 In Hirashita et al. (2003), there is a typographic error. (The 
results are correctly calculated.) The expression in this paper is 
correct and Ri in Hirashita et al. (2003) is equal to i?dust nrl H- 



and dust extinction. The photodissociation rate per unit vol- 
ume is estimated as R!diss?iH 2 > where nu 2 is the number den- 
sity of H2. We adopt the correction for the H2 self-shielding 
by Draine & Bertoldi (1996) 2 (see also Hirashita & Ferrara 
2002). Then, we estimate Shield as 



Sshield = mm 



( n H2 y 

VlO 14 cm" 2 / 



-CT d iV d 



(4) 



where Nn 2 and Nd are the column densities of H2 and dust, 
respectively, and ad is the cross section of a grain against H2 
dissociating photons. In the UV band, <Td is approximately 
estimated with the geometrical cross section: ad — tvo 2 
(Draine & Lee 1984). The column density of grains is re- 
lated to Nh: (4/3)Tva 3 SNd = lArrmNuD (the factor 1.4 is 
the correction for the helium content). Therefore, the optical 
depth of dust in UV, txjv = &dNd, is expressed as 
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4.2Af H m H © 
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Since V < 10" 2 for DLAs (often V < 10" 2 ), the dust ex- 
tinction is negligible except for the high column density and 
dust-rich regime satisfying NnT> > 10 19 cm -2 . We also de- 
fine the critical molecular fraction, /h 2 as the molecular frac- 
tion which yields A r H 2 = 10 14 cm" 2 : 



/S 2 (iY H ) = 2xl0 14 /A r H. 



(6) 



For /h 2 > /h 2 , the self-shielding affects the H2 formation. 

In this paper, we assume the Galactic (Milky Way) 
dust-to-gas ratio to be T>q — 0.01. We define the normalised 
dust-to-gas ratio k as 



(7) 



The typical Galactic ISRF intensity has been estimated 
to be cvu v = 1.2 x 10 -3 erg cm -2 s _1 at the wavelength of 
1000 A ( i.e. v = 3.0 x 10 15 Hz), where it„ is the radiation 
energy density per unit frequency (Habing 1968). Approxi- 
mating the energy density of photons at 1000 A with that 
at the Lyman- Werner Band, we obtain for Jlw at the solar 
vicinity, Jlw© — cu v /4n = 3.2 x 10~ 20 erg s _1 cm" 2 Hz" 1 
sr" 1 . The intensity normalised to the Galactic ISRF, x, is 
defined by 

X = Jlw/Jlwq ■ (8) 
Using equations and JHJ, we obtain 
R di ss = 4.4 x 10" n xS shicld s" 1 . (9) 
With x = 1> the formula reproduces the typical Galactic 

2 The exact treatment of self-shielding slightly deviates from 
the fitting formula that we adopt (see Figures 3-5 of Draine & 
Bertoldi 1996; the difference is within a factor of ~ 2). Accord- 
ingly, the estimate of radiation field in Section l3.2l is affected by 
the same amount. The difference in the shielding factor may cause 
a large difference in theoretically calculated molecular fraction 
(Abel et al. 2004). This point is commented in the last paragraph 
of Section IO 
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photodissociation rate (2-5 x 10 11 s 1 ) derived by Jura 
(1974). 

The UV background intensity at the Lyman limit is es- 
timated to be J21 = 0.3-1 around z ~ 3, where J21 is in 
units of 10 -21 erg cm -2 s _1 Hz -1 sr _1 (Haardt & Madau 
1996; Giallongo et al. 1996; Cooke, Espey, & Carswell 1997; 
Scott et al. 2000; Bianchi, Cristiani, & Kim 2001). If we as- 
sume that the Lyman limit intensity is equal to the Lyman- 
Werncr luminosity, we roughly obtain \ = 32J21- The UV 
background intensity is likely to be lower at z < 1 (e.g. Scott 
et al. 2002). Thus, the internal stellar radiation is dominant 
for H2 dissociation if X > 1/32. 

As mentioned at the beginning of this subsection, we 
can assume that the formation and destruction of H2 are in 
equilibrium. Therefore, the following equation holds: 



RdustnriH = Rdis S riH 2 



(10) 



In order to solve this equation, it is necessary to give the gas 
temperature T, the gas number density n, the normalised 
dust-to-gas ratio re, the normalised ISRF x> an d the column 
density Ah- As for the dust grains, we assume a — 0.1 /im, 
and 5 = 2 g cm -3 , all of which are typical values in the local 
universe. After solving that equation, the molecular fraction 
is obtained from the definition: 

2n H , 



/h 2 - ,, 

riH + 2riH 2 

where we neglect the ionised hydrogen for DLAs. 



(11) 



Section 12.11 the self-shielding is more important than the 
dust extinction. Thus, we neglect the dust extinction in this 
subsection. 

Under the condition that nn 2 <S «h, /h 2 — 2nn 2 / n H <§C 
1, and by using the expressions above for Rdust and Rdiss, we 
obtain the following scaling relations for /h 2 from equation 
1101 : in the unshielded regime (fH 2 Na < 10 14 cm -2 ), 

/H.KKnx-'T^Sar), (13) 

and in the shielded regime (/h 2 Nh > 10 14 cm -2 ), 

/h 2 « [Kn X -1 T 1/2 S d (T)] 4 7V I i . (14) 

We have adopted the approximate formula Shield oc 
iV" 2 - 75 for Ah 2 > 10 14 cm" 2 . In fact, there is a slight (fac- 
tor of ~ 2) difference between this scaling and the exact 
calculation (Figs. 3-5 of Draine & Bertoldi 1996). In the 
self-shielding regime, this slight difference may cause an or- 
der of magnitude difference in the calculated /h 2 because of 
a nonlinear dependence on 7Vh 2 (see the appendix of Abel 
et al. 2004). In practice, this difference can also be exam- 
ined by changing Rdust in stead of Rdiss, and indeed, if we 
change -Rdust by a factor of ~ 3, the calculated molecular 
fraction is significantly affected as shown in Section 0] For 
the statistical purpose, however, our conclusions are robust, 
since Nn 2 spans over a wide range in our sample and the 
fitting formula approximate the overall trend of Sshicid as a 
function of Nu 2 very well. 



2.2 Dust temperature 

Dust temperature is necessary to calculate the sticking effi- 
ciency of H onto grains (equation |5J . The equilibrium dust 
temperature is determined from the balance between inci- 
dent radiative energy and radiative cooling. We adopt the 
equilibrium temperature calculated by Hirashita & Hunt 
(2004) (see also Takeuchi et al. 2003): 

Td = 12 ^ 1/S ( 3 .2xlta cm )' 1/6 

/ \-V6 

x — — K, (12) 

where the constant A depends on the optical properties of 
dust grains, and for silicate grains A — 1.34 x 10 -3 cm 
(Drapatz & Michel 1977), while for carbonaceous grains 
A = 3.20 x 10~ 3 cm (Draine & Lee 1984; Takeuchi et al. 
2003). We hereafter assume Quv = 1, A = 3.20 x 10~ 3 cm, 
and a = 0.1 /jm; these assumptions affect very weakly our 
calculations due to the 1 /6 power index dependence of those 
parameters. 

2.3 Approximate scaling of molecular fraction 

As a summary of our formulation, we derive an approximate 
scaling relation for /h 2 . The molecular fraction is deter- 
mined from equation 1101 . The left-hand side describes the 
formation rate, which follows the scaling relation, -Rdust oc 
reT 1//2 Sd(T). On the other hand, the H2 destruction coef- 
ficient J?diss is proportional to \ f° r /h 2 Ah < 10 14 cm -2 
{unshielded regime), and is proportional to x(/h 2 -^Vh) -0 ' 75 
for fn 2 N H > 10 14 cm -2 (shielded regime). As mentioned in 



3 DATA SAMPLE 
3.1 Overview 

Recently, Ledoux et al. (2003) have compiled a sample of 
Lya clouds with H2 observations. The H I column density 
of their sample ranges from logiV(H 1) = 19.35 to 21.70. 
They have found no correlation between logiV(H 1) and 
log /h 2 , but have found clear correlation between the metal 
depletion (or dust-to-gas ratio, log re) and log /h 2 . Although 
some clouds have H I column densities smaller than the 
threshold for DLAs (typically log JV(H 1) > 20.3), we treat 
all the sample, because there is no physical reason for us- 
ing this threshold for H I. Such low-column density objects 
called sub-DLAs can be useful to investigate a wider range 
of iV(H 1) and get an an insight into several physical pro- 
cesses, especially self-shielding. For the absorption system 
of Q 0013—004 (the absorption redshift is z a b s = 1.973), 
we adopt the mean of the two values, —1.73, for log/H 2 
(but in the figures, the observationally permitted range, 
—2.81 < log/H 2 < —0.64, is shown by a bar). Since it is 
likely that almost all the hydrogen nuclei are in the form of 
H 1, we use iV(H 1) and A^h interchangeably. (If the ionised 
hydrogen is not negligible in a system, we can interpret our 
result to be representative of the neutral component in the 
system.) 

Ledoux et al. (2003) observationally estimate the dust- 
to-gas ratio re from the metal depletion: 

re=10 [x/H1 (l-10 [Fo/xl ), (15) 

where X represents a reference element that is little affected 
by dust depletion effects. We adopt this formula in this pa- 
per. A formal derivation of equation 115H is given by Wolfe 
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Figure 1. Relation between /h 2 (the molecular fraction of 
hydrogen) and re (dust-to-gas ratio normalised by the Galac- 
tic value). The squares are the observational data compiled by 
Ledoux et al. (2003). The data with upward and downward arrows 
indicate that only lower and upper limits for /h 2 are obtained, 
respectively, and those with rightward and leftward arrows show 
lower and upper limits for re, respectively. The cross is the data of 
Reimers et al. (2003). The thick solid, dotted, dashed, dot-dashed 
lines present our calculations for n = 100 cm -3 , T = 100 K, and 
X = 10 with different H I column densities (10 21 , 10 19 , 10 20 , and 
10 22 cm -2 , respectively). The thin solid line represents the cal- 
culation with the same parameters as the thick solid line except 
for a high formation rate of i?dust (small grains with a = 0.03 fim 
is assumed and i?dust becomes 3.3 times larger). 



et al. (2003a). Ledoux et al. (2003) have shown a correla- 
tion between log n and log /h 2 , which strongly suggests that 
H2 forms on dust grains. Stringent upper limits are laid on 
DLAs with logK < —1.5. It should also be noted that for 
DLAs with log k > —1.5 there is a large dispersion in log /h 2 • 
This dispersion implies that the molecular abundance is not 
determined solely by the dust-to-gas ratio. Therefore, it is 
necessary to model the statistical dispersion of log /h 2 • 

In Figure^ we plot the relation between molecular frac- 
tion and dust-to-gas ratio of Ledoux et al. (2003) (see their 
paper for details and error bars). We also plot the data of a 
Lya absorber in which H2 is detected (the absorption red- 
shift, z abs = 1.15) toward the QSO HE 0515-4414 (Reimers 
et al. 2003). For this absorber, we adopt log iV(H 1) = 19.88, 
log/ H2 = -2.64, and logK = -0.051 (Reimers et al. 2003). 
The lines in those figures are our model predictions, which 
are explained in the following section. 

We also use the H I column density. In particular, 
Ledoux et al. (2003) show that there is no evidence of corre- 
lation between H I column density and molecular fraction. 

3.2 Individual H2-detected objects 

In order to derive the reasonable range of physical quanti- 
ties characterising Lya absorbers, we use the H2 data. The 



objects whose H2 absorption lines are detected allow us to 
constrain the physical state of gas. As mentioned in Section 
121 our analysis is limited to H2. The excitation tempera- 
ture is related to the ratio of the column densities as (e.g. 
Levshakov et al. 2000): 



N(J) 
N(0) 



g± 
90 



exp 



B V J(J+1) 



Tj 



(16) 



where N(J') is the column density of H2 in the level J = J', 
the statistical weight of a level J, g.j, is 3(2J + 1) for odd 
J and (2 J + 1) for even J, and the constant B v — 85.36 K 
is applicable to the vibrational ground state. If the column 
densities of H2 in the rotational J-th and ground states are 
known, we can determine the excitation temperature Tjo 
by solving the above equation. In particular, T\o is the best 
tracer for the kinetic temperature (Jura 1975). Therefore, 
we approximate T ~ Tio in the discussion of this section. 

Using the column densities of J = 4 or 5 level, we can 
constrain the gas density following the simple procedure in- 
troduced by Jura (1975). Those levels are populated by di- 
rect formation into these levels of newly created molecules, 
and by pumping from J = and J = 1. The assumption that 
the lowest two levels ( J = and 1) are dominated in pop- 
ulation holds also to the H2 detected DLAs. Following Jura 
(1975) we adopt /3(0) ~ (3(1) (see also Jura 1974), where 
P( J) is the total rate of an upward radiative transition from 
level J by absorbing Lyman- and Werner-band photons. 
This assumption is true as long as the saturation levels of ab- 
sorption are the same. The distribution function of the levels 
of formed H2 is taken from Spitzer & Zweibel (1974), who 
treat a typical interstellar condition. We use the ratio of two 
column densities at the levels J — 1 and J = 0, -/V(1)/7V(0), 
and assume that fi t o = «-h 2 (l)/ n H 2 (0) = N(1)/N(0), where 
nH 2 (J') is the hydrogen number density in the level J = J' . 
Then we can rewrite equations (3a) and (3b) of Jura (1975) 
in the following forms: 



A{4 



nn 2 (4) 

— — tidustn 



Jin 



A(n o\ n H 2 (5) 

,4(5 3) — = Rdustn 



0.19 + 



0.44 



9.1 



1 + hfi 

9.1/i,o 
1 + /i,o 



P4,0 



P5,l 



(17) 
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where p^o and ps,! are the pumping efficiencies into the 
J — 4 and J = 5 levels from the J = and J = 1 
levels, respectively, and A(J' — * J") denotes the sponta- 
neous transition probability from J = J' to J = J". The 
H2 formation rate coefficient 7?dust, the gas particle number 
density n, and the number density of H I nn are defined 
in Section O We adopt A(4 2) = 2.76 x 10" 9 s" 1 , 
A(5 -» 3) = 9.85 x 10" 9 s" 1 (Spitzer 1978), p 4 ,o = 0.26, and 
p 5 ,i = 0.12 (Jura 1975). 

Assuming nH 2 (4)/riH = N(4)/Nn and riH 2 (5)/riH = 
N(5)/Nn, we estimate the gas number density n from equa- 
tions 1171 and/or 1181 . -Rdust is given by equation Q, where 
we adopt the observational dust-to-gas ratio for k and the 
excitation temperature Tio for T. Here we assume that 
Tdust = 12 K, because we do not know the ISRF at this 
stage. However, this assumption does not affect the result 
in the range of Td us t consistent with the range of x estimated 
below. 

Finally equation 1101 is used to obtain the UV radia- 
tion field x (proportional to Rdi SB \ equation [5] where Shield 
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depends on Nn 2 ) by using the estimated T = Tio and n, 
and the observed value of nn 2 /nu = Nh 2 /Nh- Each H2- 
detected object is discussed separately in the following. This 
simple analytical method suffices for the statistical character 
of our study. Careful treatments focusing on individual ob- 
jects may require a more detailed treatment of H2 excitation 
state as described, for example, in Browning, Tumlinson, & 
Shull (2003). 

3.2.1 Q 0013-004 (z abs = 1.968; 

The excitation temperature is estimated to be Tio — 73 
K (Petitjean, Srianand, & Ledoux 2002). Only the up- 
per limit is obtained for Nu (logA^H [cm -2 ] ^ 19.43). 
Thus, N(4)/N H > 3.2 x 10" 5 and N(5)/N H > 1.3 x 10~ 5 . 
Those two values indicate -Rdustn ^ 6.0 x 10~ 14 s" 1 and 
Rdustn 1.4 x 10~ 13 s _1 , respectively. This formation rate 
is very high compared with the Galactic molecular clouds 
(e.g. Jura 1975), and if the Galactic dust-to-gas ratio is as- 
sumed, we obtain n > 6 x 10 3 cm -3 . However, with this high 
density, the levels J = 2 should be in thermal equilibrium, 
but T20 = 302 K, which is higher than Tio. This indicates 
that the gas density should be less than the critical den- 
sity, n < 200 cm -3 , which is consistent with the density 
derived from C I lines (10-85 cm -3 ; Petitjean et al. 2002). 
The discrepancy between a density derived from H2 and one 
from C 1 is also reported for the absorption system of HE 
0515-4414 (Reimers et al. 2003; Section I3T51 . They sus- 
pect that the H2 formation rate is larger than the Galactic 
value. For example, if the grain size is typically smaller than 
that assumed in equation fT). the formation rate becomes 
larger. 

Since only the lower limits are obtained for Nn 2 /Nn 
and Rdustn, it is not possible to determine X- By using 
logiVH 2 = 16.77, we obtain Sshieid = 8.4 x 10~ 3 . It may 
be reasonable to assume that Nh/Nh 2 ^ 1, an d in this case, 
we obtain i? d iss = Rdustn(N H / Nh 2 ) > 6.0 x 10" 14 s _1 . With 
this i?diss and the above Sshidd, we obtain \ ^ 0.16, sup- 
porting the existence of radiation field whose intensity is 
roughly comparable to or larger than the Galactic one. 

3.2.2 Q 0013-004 (z abs = 1.973; 

The H2 abundance is only poorly constrained, and it is im- 
possible to determine the excitation temperature. We as- 
sume the same excitation temperature as the previous object 
2 abs = 1.968 (Tio = 73 K). The excitation state N(4)/N u = 
3.2 xlO" 5 is interpreted to be R dust n = 4.1 xlO" 15 s" 1 . This 
object has k = 0.099, and thus, .Rdust = 2.2 x 10" 18 cm 3 s" 1 . 
Therefore, we again obtain a high density n ~ 2000 cm -3 . 
For this object, there is a large observationally permitted 
range of Nn 2 (S'shidd = 3.0 x 10~ 5 -1.5 x 10~ 3 ), and corre- 
spondingly the range of x — 23-80 is derived. 

3.2.3 Q 0347- 383 fz abs = 3.025; 

The J = 1 level is highly populated relative to J = 0, which 
suggests that the kinetic temperature of this system is very 
high (Tio > 880 K; Ledoux et al. 2003). Levshakov et al. 
(2002) show that the excitation temperature of 825 K is ap- 
plicable to the J = 0-5 levels. However, they also show that 



the width of the H2 lines indicate that the kinetic temper- 
ature is less than 430 K. The excitation state N(4)/Nn = 
3.6 x 10~ 8 indicate that Rdustn = 2 x 10" 16 s _1 , consistent 
with Levshakov et al. (2002). With k = 0.0857, we obtain 
-Rdust = 1.8 x 10~ 18 cm" 3 s" 1 and 1.1 x 10~ 18 cm -3 s" 1 
for T = 400 K and 800 K, respectively. Thus, we obtain 
n = 100-200 cm -3 . Levshakov et al. (2002) derive a gas 
density much lower (6 cm -3 ) than these, but they assume 
much higher -Rdust comparable to the Galactic value. How- 
ever, such a large i?dust is difficult to achieve, since the dust- 
to-gas ratio is much smaller. The UV radiation field derived 
from Rdustn and S'shidd = 0.38 is x — 15. 

3.2.4 Q 0405-443 fz abs = 2.595; 

The excitation temperature is Tio = 100 K (Ledoux et al. 
2003). The upper limit for AT(4) is obtained {N(A)/Nn < 

9.3 x 10 -8 ). This upper limit is interpreted to be Rdustn < 

2.4 x 10" 16 s"\ and using R dnst = 1.2 x 10" 18 cm 3 s _1 
derived from k — 0.049, we obtain n < 200 cm -3 . The upper 
limit of the radiation field is estimated form the upper limit 
of -Rdustn as x < 4.1 (with S'shidd = 7.3 x 10~ 4 ). 

3.2.5 Q 0528-250 fz abs = 2.81i; 

There is no information on the excitation state of H2. 

3.2.6 Q 0551-366 fz abs = 1.962; 

The component z a bs = 1.96214 dominates the H2 content in 
this system (Ledoux et al. 2002). For this component, the 
excitation temperature is Tio — 110 K. From N(4:)/Nn = 
7.1 x 10~ 7 , we obtain Rdustn = 2.0 x 10" 15 s" 1 , and with 
-Rdust = 1.6 x 10 -17 (k = 0.63), we obtain n = 130 cm" 3 . 
The shielding factor is Sshidd = 2.3 x 10~ 3 , which indicates 
that the UV radiation field is estimated to be x — 24. The C 
1 fine structure levels indicate n ~ 55-390 cm -3 , consistent 
with our estimate. 

3.2.7 Q 1232+082 (z abB = 2.338) 

This object is observed by Srianand, Petitjean, & Ledoux 
(2000). The excitation temperature is Tio = 185 K. Var- 
shalovich et al. (2001) observe HD lines and estimate the 
excitation temperature from the ratio of J = 1 and J — 
populations as T = 70 ± 7 K. This is lower than above, but 
the cold gas phase is supported. 

The fraction of the J = 4 population is N(4)/Nh = 
5.6x 10" 7 , which leads to Rdustn = 2.2xl0" 15 s" 1 . By using 
k = 0.043, we obtain R dust = 1.1 x 10" 18 cm 3 s" 1 . Then, 
the density is estimated to be n — 2000 cm~ 3 . We can also 
use N(5)/N H = 5.8 x 10~ 7 to derive n = 4100 cm" 3 . Those 
densities are large enough to thermalise the level J = 2, 
but this level is not thermalised. Therefore, Srianand et al. 
(2000) argue that n should be much smaller. They derive the 
gas density 20 < nn < 35 cm -3 from the C I observation. 
The absorption systems of Q 0013-004 and HE 0515-4414 
also have the same problem that the density derived from H2 
excitation is too high. The H2 formation rate -Rdust may be 
larger than that estimated here probably because of a small 
grain size. Ledoux et al. (2003) argue that the molecular 
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fraction log /h 2 = —3.41 (log Ah 2 = 17.19) should be taken 
as a lower limit. Therefore, Sghield ^ 3.9 x 10~ 3 , and an 
upper limit for the UV radiation field is estimated to be 



3.2.8 Q 1444+014 (z.bs = 2.087; 

The excitation measurements indicate that Tio = 190 K 
(Ledoux et al. 2003). From the observational upper limit of 
the J = 4 population, A(4)/A H < 1.4 x 10" 6 , we obtain 
Rdustn ^ 5.4 x 10" 15 s" 1 by using k = 0.225. Then we 
obtain n < 950 cm -3 . With .Sshieid = 5.8 x 10" 4 , the upper 
limit of -Rdust gives the upper limit of the ISRF of Rdustn is 
12. 



3.2.9 HE 0515-4414 (ze. bs = 1.15; 

This object has been observed by Reimers et al. (2003). 
The excitation temperature Tio = 90 K and the excitation 
state A(4)/Ah = 1.3 x 10~ 5 are obtained. Then, we obtain 
Rdustn — 3.1 x 10" 14 s _1 . By using k = 0.89, we estimate 
that -Rdust = 2.1 x 10" 17 cm 3 s" 1 , and the density is thus 
estimated to be 1500 cm -3 . Again we obtain a very high 
density, but the J = 2 level is not thermalised (Reimers 
et al. 2003). Therefore, the density should be much lower. 
Reimers et al. (2003) also report the same problem, adopt- 
ing the density derived from C I (n ~ 100 cm , Quast, 
Baade, & Reimers 2002). Perhaps a large H2 formation rate 
i?dust is required as in Q 0013—004 (z a bs = 1.968) and Q 
1232+082 (z abs = 2.338). With S shle u = 5.9 x 10 -3 , the 
ISRF estimated from .Rdustn is x — 100. 



3.3 Typical physical state 

Although the H2 detected objects have a wide range of phys- 
ical quantities, they are roughly consistent with a cold phase 
whose typical density and temperature are n ~ 100 cm" 3 , 
T ~ 100 K, respectively. The ISRF is generally larger than 
the Galactic value, and the intensity is roughly summarised 
as an order-of-magnitude difference, \ ~ 10- Some objects 
may indicate much higher densities in ~ 1000 cm -3 ) than 
those derived from C I excitation states. The discrepancy 
may be due to the high H2 formation rate on dust grains, 
suggesting that -Rdust may be larger than that assumed in 
this paper. In view of equation Q, -Rdust becomes larger if 
the typical size of grains is smaller. 

Our previous work (Hirashita et al. 2003) suggests that 
the covering fraction of the region with a density larger than 
1000 cm is low (< 1%) because of small sizes of such dense 
regions. Indeed if we assume the density of 1000 cm" 3 and 
the column density of 10 21 cm -2 , we obtain the size of 0.3 
pc. Because of small geometrical cross sections, such dense 
regions rarely exist in the line of sight of QSOs. Therefore, 
for the statistical purpose, we do not go into details of very 
dense (> 1000 cm" 3 ) regions. 

The extremely high density might be an artefact of our 
one-zone treatment. If the destruction of H2 occurs selec- 
tively on the surface of the clouds and the formation of 
H2 occurs in the interior of the clouds, our approach will 
not work. However, the formation and destruction should 



be balanced globally, and our approach could give the first 
approximation of such a global equilibrium. 

We should note that those quantities are derived only 
from the H2 detected objects. Those could be associated 
with the star-forming regions, since stars form in molecular 
clouds. Therefore, the physical quantities derived from H2- 
detected objects could be biased to high radiation fields and 
high gas density. In fact, diffuse warm neutral gas (T ~ 8000 
K and n ~ 1 cm -3 ; McKee & Ostriker 1977) occupies a 
significant volume in the interstellar spaces of galaxies and 
can be detected more easily than cold gas (Hirashita et al. 
2003). Therefore, we also examine more diffuse gas whose 
typical density is much smaller than 100 cm -3 in Section 
14.41 Indeed, the spin temperature of H I in Chengalur & 
Kanekar (2000) is high (T > 1000 K) for a large part of their 
sample, although the relation between the spin temperature 
and the kinetic temperature should be carefully discussed. 



4 DUST AND H 2 

4.1 Various column densities 

We examine the relation between dust-to-gas ratio and H2 
abundance of the sample Lya systems. The relation can be 
predicted by solving equation I1UI . First of all, we should 
examine if the relation is compatible with the reasonable 
physical state of gas. From the H2 detected objects, we have 
derived n ~ 100 cm" 3 , T ~ 100 K, and \ ~ 10- In Figure □ 
we show the relation between /h 2 and k calculated by our 
model with various H I column densities (n = 100 cm" 3 , 
T = 100 K, and \ = 10 are assumed). The thick solid, dot- 
ted, dashed, and dot-dashed lines represent the results with 
Ah = 10 21 , 10 19 , 10 20 , and 10 22 cm -2 , respectively. The 
thin solid line represent the result with a high H2 formation 
rate on grains as suggested for some objects f Section 13. 21 . 
where a — 0.03 um is assumed to increase -Rdust by 3.3 times 
in order to see the effect of increased H2 formation rate. 

The relation between molecular fraction and dust-to- 
gas ratio is well reproduced. The rapid enhancement of 
molecules for /h 2 > /h 2 (equation |SJ is caused by the self- 
shielding effect. If Ah is large, the self-shielding condition is 
achieved with a small value of /h 2 . Therefore, the molecular 
fraction tends to become larger in systems with larger Ah. 

The molecular fraction is very sensitive to the H I col- 
umn density and the H2 formation rate on dust grains in 
the self-shielding regime. Because of such a sensitive depen- 
dence, the large dispersion of /h 2 among the H2 detected 
objects can be reproduced by the four lines. 



4.2 Density and temperature 

Here, we examine the dependence on gas density, tempera- 
ture, and ISRF. In the previous subsection, we have shown 
that /h 2 depends on Ah. Therefore, we only concentrate on 
the objects with 20.5 < log Ah < 21.5. This range is typical 
of DLAs. In Figure^ we show the relation between /h 2 and 
k for various (a) gas density, (b) gas temperature, and (c) 
intensity of ISRF. 

In Figure [5^, we investigate the three densities n = 
100 cm -3 , 30 cm -3 , and 300 cm -3 , in order to test if the 
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Figure 2. Same relations as Figure HI Only the data with 20.5 < logTVir < 21.5 are shown (squares), (a) The solid, dotted, and dashed 
lines represent our calculations for n = 100 cm -3 , n = 30 cm -3 , and n = 300 cm -3 , respectively. The other quantities are fixed: 
T = 100 K, x = 10, and N H = 10 21 cm" 2 . The DLA toward Q 0405-443 (2 abs = 2.595), marked with 'Q0405', has an ISRF (x < 4.1) 
much smaller than assumed here, (b) The solid, dotted, and dashed lines correspond to our results for T = 100 K, 30 K, and 1000 K, 
respectively, with n = 100 cm -3 , \ = 10, and Nn = 10 21 cm -3 , (c) The solid, dotted, and dashed lines represent our results for \ = 10, 
3, and 30, respectively, for n = 100 cm -3 , T = 100 K, and Nh = 10 21 cm -2 , respectively. 



observational data points are reproduced with an order-of- 
magnitude density range centered at the typical density de- 
rived for H2 detected objects (n ~ 100 cm -3 ). We assume 
typical quantities: T = 100 K, \ = 10, and log Ah = 21, and 
we adopt a = 0.1 /im and S = 2 g cm (unless otherwise 
stated, we adopt those a and 8 throughout this paper). The 
data are well reproduced except for a point marked with 
'Q0405'. This represents the DLA at z a b s = 2.595 toward 
the quasar Q 0405-443 (log Nn = 20.90). However, for this 
object, the ISRF is estimated to be x < 4.1 (Section 13. 21 . 



smaller than assumed here (x = 10). This low radiation field 
is a possible reasons for the molecular abundance larger than 
that predicted by the models, although there could be other 
possibilities (e.g., large -Rdust). 

Figure shows the dependence on gas temperature. 
As discussed in Section f2. II the temperature dependence of 
the H2 formation rate -Rdust is uncertain. For example, if we 
take into account the recombination efficiency in Cazaux & 
Tielens (2002), the H2 formation rate is much reduced for 
T > 100 K, and /h 2 becomes more than 4 times smaller than 
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presented in this paper. Therefore, Figure^ is shown only 
to examine the conventional reaction rate often assumed in 
other literatures. The DLA toward Q 0347—383 at z a b s = 
3.025 has the highest T 10 ~ 800 K of all the H 2 detected 
DLAs, and we examine the temperature range from 30 K 
up to 1000 K. The solid, dotted, and dashed lines in Figure 
correspond to T = 100 K, 30 K, and 1000 K, respectively. 

The temperature may systematically change as a func- 
tion of dust-to-gas ratio («), because the photoelectric heat- 
ing of dust dominates the gas heating (Wolfire et al. 1995). 
However, at least for the H2 detected objects, there is no ev- 
idence that the gas temperature correlates with the dust-to- 
gas ratio. In this paper, we do not include such a correlation 
in our analysis. 

Figure^ shows the dependence on the ISRF intensity. 
We examine x = 3, 10, and 30, in order to examine an or- 
der of magnitude centered at the typical value derived for 
H2 detected objects. We see that the range well reproduces 
the observed data points. The increase of n has almost the 
same influence on the decrease of x, i- e -, the result become 
the same if we assume the same Rdustn/x- Indeed, the ratio 
between the H2 formation and destruction rates is propor- 
tional to Rdustn/x- 

The above results generally show that /h 2 is sensitive to 
the variation of physical quantities particularly in the self- 
shielding regime. The sensitive dependence of /h 2 in the self- 
shielding regime causes a large dispersion of /h 2 , and almost 
all the data points with a large scatter are explained by the 
density and temperature range considered above (see also 
the discussion in Section l2.3ft . The scatter typically arises 
for log k > — 1.5. 

As a summary of this section, we present the likelihood 
contours on the /h 2 — ft diagram under the condition that 
n, T, and x vary in the above range: 1.5 ^ logn [cm -3 ] 
2.5, 1.5 < logT [K] ^ 3, and 0.5 «S logx «S 1-5. Here 
the likelihood is defined as the number of combinations of 
(logn, logT, logx) that satisfies a certain (log/n 2 , logft). 
We follow the formulation described in Appendix [E] where 
we put x = (log/ H2 , logft), y = (logn, logT, logx), N = 
64, and M = 64 with the range of (logn, logT, logx) de- 
scribed above ([1.5, 2.5], [1.5, 3], and [0.5, 1.5], respectively) 
(the result is independent on N and M if we adopt num- 
bers larger than ~ 30). As before, the observational sample 
is limited to the DLAs with 20.5 < log Ah [cm -2 ] < 21.5 
are examined, and we assume Ah = 10 21 cm -2 in the the- 
oretical calculation. For a more detailed analysis, the prob- 
ability distribution functions of (logn, logT, logx) should 
be considered. Since the probability distribution functions 
are unknown for those quantities, we only count the number 
of solutions. The ranges constrained here could be regarded 
as the typical dispersions (a). The possible physical correla- 
tion between those three quantities is also neglected in our 
analysis. We leave the modeling of the physical relation of 
those quantities for a future work (see Wolfire et al. 1995 for 
a possible way of modeling). 

In Figure C3 we show the contour of the likelihood P 
( Appendix [BJ. The levels show likelihood contours of 50%, 
70%, 90%, and 95% according to our model (see Appendix 
fTH. All the data points are explained by the assumed ranges 
of the quantities. The wide range of /h 2 covered in the self- 
shielding regime (log /h 2 > —6.7) explains the observational 
large scatter of /h 2 - 




log K 



Figure 3. Likelihood contour on the log/n 2 — logK diagram, 
when the density, temperature, and radiation field are varied in 
the range 1.5 < logn [cm" 3 ] ^ 2.5, 1.5 < logT [K] ^ 3, and 
0.5 $C logx ^ 1-5) respectively. We adopt TVh = 10 21 cm -2 . 
The contours are drawn for four levels: 95%, 90%, 70%, and 50% 
of the sample is supposed to be in the regions corresponding to 
each level. The observational data points are the same as those 
in Figure |21 

4.3 Column density 

Another important conclusion derived by Ledoux et al. 
(2003) is that the H I column density and the molecular frac- 
tion do not correlate. Therefore, we also present our model 
calculation for the Ah — /h 2 relation. Since the difference in 
the dust-to-gas ratio reproduces a very different result, we 
only use the sample with 0.01 < k < 0.1. Here we assume 
k = 0.03. In Figure [I] we show our results with various gas 
density (the thick solid, dotted, and dashed lines represent 
the results with n — 100, 30, and 300 cm -3 , respectively), 
where we assume that T = 100 K, k — 0.03, and x = 10. 
The thick lines underproduces the observed H2 fraction of 
the H2 detected objects, since the dust-to-gas ratio of those 
objects are systematically higher than that assumed. In par- 
ticular, the data point with the bar (z a b s = 1.973 toward Q 
0013-004) has the dust-to-gas ratio of K = 0.099. This data 
point can be reproduced by the thin dashed line produced 
with the same condition as the dashed line but with k = 0.1. 
As seen in Section [4.21 the same n/x reproduces the same 
result with the other quantities fixed. Thus, we do not show 
the result for the various ISRF intensity X- 

The thin dotted line in Figure 2] represents the relation 
/h 2 — /h 2 (equation Therefore, if a data point is above 
the thin dotted line, the H2 is self-shielding the dissociat- 
ing photons. We observe that the molecular fraction is very 
sensitive to the gas density and ISRF intensity, especially 
in the self-shielding regime. This sensitive dependence tends 
to erase the correlation between /h 2 and Ah in the observa- 
tional sample and could explain the absence of correlation 
in the observational sample. 
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Figure 4. Relation between the molecular fraction and the H I column density. Only the observational sample with 0.01 < re < 0.1 is 
selected (squares). The upper and lower arrows represents the observational lower and upper limits for the molecular fraction, respectively. 
The thick solid, dotted, and dashed lines present our calculations with various gas density (n = 100, 30, and 300, respectively). The gas 
temperature, dust-to-gas ratio and ISRF intensity are assumed to be T = 100 K, re = 0.03, and \ = 10> respectively. The thin dashed 
line indicate the same calculation with the thick dashed line, but a higher dust-to-gas ratio of re = 0.1 is assumed. We also show the 
self-shielding condition by dotted line, above which the H2 self-shielding effect becomes significant. 



4.4 Possibility of warm phase 

As mentioned in Section [3.31 our derived quantities are bi- 
ased to the H2 detected objects. As shown in Hirashita et al. 
(2003), Eb-rich regions are only confined in a small dense re- 
gions, whose gas temperature is T < 100 K. However, Chen- 
galur & Kanekar (2000) observationally derive the spin tem- 
perature T > 1000 K for a large part of their DLA sample 
and T ~ 100 K for a few DLAs, although the large beam size 
relative to the size of the QSO may tend to overestimate the 
spin temperature. Ledoux et al. (2002) find that the DLAs 
with H2 detection are always dense (n > 30 cm -3 ) and cool 
(T < 100 K). Therefore, most of the DLAs may be warm 
except for the H2 detected ones. 

Based on a simulation suitable for DLAs, Hirashita et 
al. (2003) have shown that most (~ 90%) of the regions are 
covered by warm and diffuse regions with T ~ 1000-10000 
K and n ~ 1 cm -3 . In the warm phase, H2 formation on 
dust is not efficient and H2 formation occurs in gas phase 
(Liszt 2002; Cazaux & Tielens 2002). Therefore, we cannot 
put any constraint on the physical state of warm gas in the 
framework of this paper. The H2 formation in gas phase 
occurs in the following route: H + e~ — > H~ + hv, H~ + H — > 
H 2 +H+, and H++H -> H+, H++H -» H 2 +H+. Liszt (2002) 
shows that the gas phase reactions result in a molecular 
fraction /h 2 ~ 10 -7 -10 -8 . This range is consistent with the 
data with upper limits of /h 2 • 



4.5 Lack of very H2-rich DLAs? 

The likelihood contours presented in Figure suggests that 
some DLAs should be rich in H2 (log/H 2 > —1) if the dust- 
to-gas ratio is around the Galactic value (logft > —0.5). 
However, all the objects in the sample have molecular frac- 
tion log/H 2 < —1- There are three possible explanations 
that we discuss in the following. 

The first possibility of the lack of very molecule-rich 
DLAs is the contamination from the molecule-poor inter- 
cloud medium. If the contribution of intercloud medium to 
the column density is high, the molecular fraction is inferred 
to be low even if a molecule-rich region is present along the 
line of sight. 

Secondly, a QSO selection effect might occur. If the 
dust-to-gas ratio is as high as logft > —0.5 and the col- 
umn density is Ah = 10 21 cm -2 , the optical depth of dust 
in UV is larger than 0.3 (equation 0. Therefore, QSO is 
effectively extinguished by dust if there is a dust-rich cloud 
in the line of sight. Such a population is also suggested by a 
numerical simulation (Cen et al. 2003). Observationally, it is 
a matter of debate whether the dust bias is large or not. El- 
lison et al. (2002) study optical colours of optically-selected 
QSO samples, and find that the effect of dust extinction 
of intervening absorbers is small. Fall et al. (1989) show a 
significant dust extinction of intervening DLAs by showing 
that QSOs with foreground DLAs tend to be redder than 
those without foreground DLAs. 
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The third possibility is concerned with the probability 
of detecting molecule-rich DLAs. Hirashita et al. (2003) have 
shown that the covering fraction of Irb-rich clouds in a galac- 
tic surface is < 10%. Therefore, the probability that the line 
of sight passes through such clouds may be very low. Some 
very small molecule-rich clumps, which would be difficult to 
find as QSO absorption systems, are also found (e.g. Richter 
et al. 2003; Heithausen 2004). The probability distribution 
function of gas density and temperature should be treated 
by taking into account the covering fraction. The detailed 
treatment of such probability is left for the future work. 

Extremely molecular-rich objects with /h 2 ~ 1 might 
escape H I absorption detection (Schaye 2001). A new strat- 
egy will be required to detect such fully molecular clouds 
at high z. An observational strategy for high-z molecular 
clouds with high column densities is discussed in Shibai et 
al. (2001). 



4.6 Summary of our analysis 

In order to summarise our analysis and for the observers' 
convenience, we present Figure Various physical states of 
gas could be discriminated on the /h 2 — « (molecular frac- 
tion vs. dust-to-gas ratio) diagram. First of all, we should 
stress that this diagram is only useful to obtain the first re- 
sult about the gas state of an absorption line system whose 
H2 fraction and dust-to-gas ratio are constrained, or to ob- 
tain the statistical properties of gas phase of a sample of 
absorption systems. For the confirmation of gas state, more 
detailed analysis such as treatment of C I fine structure lines 
should be combined. 

In Figure|S] the shaded area shows the region where 90% 
of the gas with 1.5 < logn [cm -3 ] < 2.5, 1.5 < logT [K] < 3, 
and 0.5 ^ logx ^ 1-5 is predicted to be located (see Fig- 
ure |HJ|. Those ranges of the quantities are typical of H2- 
detected objects and representative of "cold" gas. The strip 
indicated by "warm" shows the warm phase in which H2 
predominantly forms in the gas phase (we take the values 
from Liszt 2002). There is an overlapping region of the cold 
and warm states, and if a data point lies in this region, the 
warm and cold states are equally probable. There remain 
two regions not included neither in "cold" nor in "warm". 
The upper region shows an enhancement of the molecular 
fraction, which requires a high H2 formation rate and/or a 
low H2 destruction rate. This condition is typically charac- 
terised with n/x^Z 100 if l ne S as temperature is favourable 
for the H2 formation on dust grains (< 100 K). On the 
other hand, the lower region marked with "n/x < 1" indi- 
cates that the H2 fraction is larger than the typical value 
in warm phase but lower than our likelihood range for cold 
gas. If a data point lies in this region, there could be the 
following three possibilities: (i) n/x iS 1 and T < 100 K, so 
that the H2 formation rate is suppressed because of a low 
density and/or the H2 destruction rate is enhanced because 
of a high ISRF; (ii) 300 < T < 500 K, so that the H 2 for- 
mation on dust occurs only with a small rate; (iii) the cold 
and warm phases coexist in the line of sight. We can use 
this diagram as the "first quick look" for the physical state 
of gas if H2 is detected in a system whose dust-to-gas ratio 
has been estimated. 




-3 -2 -1 

log K 



Figure 5. Summary of our analysis. If we obtain the molecular 
fraction and dust-to-gas ratio for an absorption system, we can 
roughly estimate the physical state of gas. This diagram should 
only be used to obtain the first rough estimate. The region marked 
with "cold" indicates the same area as the 90% level of Figure 131 
(representative of a typical cold phase). The strip marked with 
"warm" indicates the typical range of /h 2 in a warm phase where 
H2 is formed in gas phase (Liszt 2002). The other two regions 
are possibly characterised by the physical state, n/x ^ 100 and 
n/x ;$ 1 (see text for details). 



5 STAR FORMATION 
5.1 Star formation rate 

The above results for the relation between molecular abun- 
dance and dust-to-gas ratio for DLAs strongly suggest that 
there are internal UV radiation sources originating from star 
formation activity (see also Ge & Bechtold 1997; Ledoux et 
al. 2002; Petitjean et al. 2002). Indeed, the cosmic UV back- 
ground radiation intensity is typically J21 ~ 1 around z ~ 3. 
This corresponds to x - 3.1 x 10~ 2 . The ISRF intensity is 
clearly larger than this, and the local heating sources such 
as stars dominates the ISRF. Assuming that the ISRF is 
produced by stars, we relate the ISRF with the SFR. 

Hirashita, Buat, & Inoue (2003) have derived the rela- 
tion between UV luminosity and SFR as 

SFR = C2ooo^2ooo , (19) 

where L2000 is the monochromatic luminosity at 2000 A and 
C2000 = 2.03 x 10~ 40 [(M Q yr" 1 )/(erg s" 1 A" 1 )] (under a 
Salpeter initial mass function with the stellar mass range 
of 0.1-100 Mq and a constant star formation rate with the 
duration of 10 s yr; see also Iglesias-Paramo et al. 2004). 
The surface luminosity density, defined as the luminosity per 
surface area, can be roughly equated with the ISRF inten- 
sity (Appendix[XJ. Therefore, the 2000 A surface monochro- 
matic luminosity density, E2000, is estimated as 

E2000 — «t2ooo = 9.0 x 10 _7 x erg cm -2 s -1 A 1 . (20) 
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where we use the 2200 A energy density in Habing (1968) for 
the normalisation of U2000 (2000 A monochromatic radiative 
energy density). Using equation 1)20^ . we obtain the surface 
density of SFR, Esfr: 

Esfr = C2000S2000 = 1.7 x 10 _3 x M Q yr" 1 kpc" 2 . (21) 

Indeed, this roughly gives the Galactic surface SFR den- 
sity (~ 10" 3 Mq yr" 1 kpc" 2 ; Burkert, Truran, & Hensler 
1992) if we assume a typical ISRF intensity of the solar 
vicinity {\ = 1). But some observational data indicate 
higher Galactic surface SFR in the solar vicinity such as 
- 5 x 10" 3 Mq yr -1 kpc -2 (Smith et al. 1978). The gen- 
eral ISRF in the Galaxy could be systematically higher (i.e. 
X > 1; Shibai et al. 1999). 

Wolfe et al. (2003) have derived the calibration Esfr = 
2.5 x 10 _3 x Mq yr -1 kpc -2 . The difference comes from the 
different assumption on the IMF and the different stellar 
mass-luminosity relation. However the difference in only 0.17 
in the logarithmic scale. 

The probable range of the radiation field constrained 
in section T4. 21 is 0.5 < logx J; 1-5. This range predicts the 
surface SFR density, 5 x 10" 3 Mq yr -1 kpc -2 < E SF r < 
5 x 10~ 2 Mq yr" 1 kpc" 2 . Wolfe et al. (2003a) derive the 
UV radiation field from C n* absorption line. Their anal- 
ysis is dependent on the assumed phase (cool or warm) 
of the ISM. The calculated SFR differs by an order of 
magnitude between the cool and warm media. Based on 
their results, Wolfe et al. (2003b) have suggested that the 
probable range of the SFRs of their sample is 10~ 3 < 
Esfr [Mq yr -1 kpc -2 ] < 10~ 2 . This range is roughly con- 
sistent with our range considering the uncertainty in the as- 
sumed physical state of gas. A numerical work by Nagamine, 
Springel, & Hernquist (2004) explains the SFR theoretically. 
Our estimate provides an independent observational calcu- 
lation for the SFR of DLAs. 

Assuming a typical radius of R = 3 kpc for DLAs 
(Kulkarni et al. 2000), we obtain the SFR Esfrtt-R 2 ~ 0.1- 
1 Mq yr" 1 . This range is broadly consistent with the up- 
per limits obtained by some imaging observations of DLAs 
(Bunker et al. 1999; Kulkarni et al. 2000; Bouche et al. 2001). 

5.2 Comparison with other SFR estimates 

Wolfe et al. (2003a) have estimated the SFR of a sample 
of DLAs by using C n* absorption intensity. They con- 
sider the balance between the cooling rate dominated by 
[C 11] fine structure line emission and the heating rate domi- 
nated by photoelectric heating of a UV radiation field. Since 
our method for SFR estimate is independent of theirs, the 
consistency between the two methods is interesting to ex- 
plore. Wolfe et al. (2003a) investigate two thermally stable 
states, WNM (warm neutral medium) and CNM (cold neu- 
tral medium), adopting the scheme of Wolfire et al. (1995). 
There are three overlapping samples between Wolfe et al. 
(2003a) and Ledoux et al. (2003) as shown below. 

5.2.1 Q 034 7- 383 (z abs = 3.025,) 

Some H2 lines are detected in this object, and we have 
derived x = 15 (Section 13. 2> . which is equivalent to 
log Esfr [Mq yr -1 kpc -2 ] = -1.6. Wolfe et al. (2003a) de- 
rive log Esfr [Mq yr -1 kpc -2 ] = —1.3 and —2.2 for the 



WNM and CNM, respectively. Our estimated temperature 
T = 400-800 K is higher than the typical value for CNM 
(~ 100 K) and lower than the WNM. In any case, our esti- 
mate is between the two values of Wolfe et al. (2003a). 3 

5.2.2 Q 1223+118 (z abs = 2.466; 

For this object, the depletion factor is extremely small 
([Fe/Zn] = -0.07 ± 0.21), and the metallicity is also small 
([Zn/H] = -1.63 ± 0.11) (Ledoux et al. 2003). Thus, it is 
expected that dust-to-gas ratio of this object is extremely 
small. Moreover, if we assume the above metal abundances, 
we obtain log n = —2.46. With this dust abundance, the 
formation of H2 through H~ could be an important process. 
The SFR cannot be estimated by our method. 

5.2.3 Ql 232+ 082 (z abs = 2 . 338; 

In Section 13.21 we have estimated the radiation field to 
be x ^ 64, which is equivalent to the surface SFR den- 
sity log E S fr [Mq yr" 1 kpc" 2 ] < -0.96. The low excita- 
tion temperature (Tio = 185 K) indicates that the gas in 
the CNM. The CNM solution of Wolfe et al. (2003a) shows 
log Esfr [Mq yr -1 kpc -2 ] = —1.9, consistent with our up- 
per limit. 

5.3 Cosmological implications 

Wolfe et al. (2003b) have extended their discussion on the 
SFR in DLAs to a cosmological context. They have found 
that the hypothesis that most of the DLAs are in WNM 
is ruled out because it would conflict with background light 
constraint. On the other hand, Chengalur & Kanekar (2000) 
have observationally shown that a large part of their DLAs 
have a temperature similar to WNM. Hirashita et al. (2003), 
by using detailed numerical simulations, have also argued 
that the probability to observe the CNM of a DLA is small 
because the tiny covering fraction of such phase. Their cal- 
culation also shows that H2 detected objects are biased to 
the CNM, and is consistent with the estimates in Section 
EH1 Most of the H 2 deficient DLAs may be in the WNM. 
Thus at the moment, there seems to be tension between the 
CNM and WNM hypotheses, which requires additional work 
in order to be relaxed. A composite analysis of fine-structure 
excitation and H2 may be required to confirm our results of 
a simple statistical approach. 

In spite of such an uncertainty, it is interesting to note 
that the star formation activity in DLAs can be investi- 
gated via our treatment of H2 formation and destruction 
rates. The surface SFR density derived by us is comparable 
to that inferred in Wolfe et al. (2003a), who have suggested 
that DLAs are an important population in the cosmic star 
formation (and metal production) history (see also Pei & 
Fall 1995; Pei, Fall, & Hauser 1999). The SFR is much lower 
than forming elliptical galaxies as calculated by Arimoto & 
Yoshii (1987), but more similar to spiral galaxies. In the con- 
text of the hierarchical structure formation, Nagamine et al. 
(2004) have explained the SFR of DLAs by a cosmological 
simulation, and they also show that the SFR of a DLA is gen- 
erally smaller than that of a typical Lyman break galaxies 
(~ 50 Mq yr" 1 ) (but see Schaye 2004). According to their 
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simulation, the host halo mass of DLAs spans over a wide 
range from ~ 10 9 M e to ~ 10 12 M Q . Another theoretical 
calculation by Hirashita & Ferrara (2002) show that high- 
z dwarf galaxies whose total mass of dark halo is around 
10 9 Mq has a SFR similar to that estimated in Section f5. II 
Our statistical work in this paper will be extended to the 
cosmological star formation history and some observational 
consequences (see also Ferrara et al. 1999). The redshift evo- 
lution of H2 abundance and the relation between metals and 
H2 (Curran et al. 2004) can also be used to constrain the 
cosmological SFR. 



6 SUMMARY 

We have modeled the H2 abundance of QSO absorption line 
systems and compared our model calculations with obser- 
vational samples of damped Lya systems (DLAs) and sub- 
DLAs. We have derived the H2 abundance, /h 2 , as a function 
of dust-to-gas ratio, k (normalized to the Galactic value) 
considering H2 self-shielding and dust extinction against dis- 
sociating photons. The /h 2 — K relation depends on the gas 
density (n) and temperature (T), and the ISRF intensity 
(X- normalised to the Galactic value). Our aim has been to 
constrain those quantities by using H2 data. Treating the 
data of H2 excitation states of the H2 detected objects, we 
adopt 1.5 < logn [cm -3 ] < 2.5, 1.5 < logT [K] < 3, and 
0.5 < logx % 1.5. From the comparison with data, we have 
found that the observational /h 2 _ « relation is naturally 
explained by the above range. The efficient photodissocia- 
tion by the ISRF can explain the extremely small H2 abun- 
dance (/h 2 % 10 -6 ) observed for k < 1/30. We have also 
succeeded in explaining the rapid increase of H2 abundance 
for k > 1/30 by the effect of self-shielding of H2 dissociating 
photons. Because of a nonlinear dependence of self-shielding 
on the physical quantities, a large scatter of H2 fraction is 
reproduced. However, we should note that the above param- 
eter range may be biased to the cold gas favourable for H2 
formation. It is still possible that most of the H2 deficient 
DLAs and sub-DLAs might be in a diffuse and warm state. 

We finally propose to estimate star formation rates 
(SFRs) of (sub-)DLAs from H 2 observations. The SFRs es- 
timated by our method are compared with those derived by 
Wolfe et al. (2003a). Two common samples give roughly con- 
sistent SFRs. The strength of UV field indicates a surface 
SFR density: 5 x 10" 3 -5 x 10" 2 M Q yr _1 kpc -2 . Therefore, 
DLAs are actually star-forming objects. The SFR is smaller 
than typical Lyman break galaxies, but DLAs may be a ma- 
jor population responsible for star formation in the high-z 
universe. 
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APPENDIX A: SURFACE LUMINOSITY 
DENSITY AND RADIATION FIELD 

In the text, we have equated the surface luminosity density 
E with the interstellar radiation field (ISRF) intensity cu. 
However, it is not obvious a priori that this assumption is 
correct. If the radiation from a stars is isotropic and the dust 
extinction is neglected, the ISRF intensity at the position r, 
ctt(r), is expressed as 



cu(r) 



I 



d r 



p(r') 



An\r' — r\ 2 



(Al) 



where p(r) is the spatial luminosity density of stars (p(r)d 3 r 
is the luminosity in the volume element d 3 r). 

One of the largest uncertainties is the spatial distribu- 
tion of radiating sources. Therefore, in the following discus- 
sions, we derive the relation between S and cu under two 
representative geometries of source distribution: spherical 
and disc-like distributions. 



Al Spherical distribution 

In the spherical symmetric distribution, we can calculate the 
ISRF intensity at the centre of the sphere by 



dR 4ttR' 



p(R) 

4tyR 2 



dRp(R) , 



(A2) 



where Ro is the radius of the whole emitting region. The 
right-hand side gives a rough estimate of the surface lumi- 
nosity density (denoted as E), and therefore, the ISRF can 
be equated with the surface luminosity density (i.e. E ~ cu). 



A2 Disc-like distribution 

In the disc-like distribution, we can calculate the ISRF in- 
tensity at the centre of the disc by 



/ dR2nR I 
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where 2h is the disc thickness, and Ro is the disc radius. 
If we assume for simplicity that p is constant, we obtain 
analytically 
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I h 
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The galactic discs usually satisfy Ro S> h, so that the above 
equation is approximated as 

Ro ' 



p*( 1 + ln£ 



(A5) 



The dependence on Ro/h is very weak, and even if we assume 
a very thin disc such as Ro/h — 100, we obtain cu ~ 5.6ph. 
The surface luminosity density can be typically estimated 
by E ~ 2ph. Therefore, cu and E have the same order of 
magnitude (different by a factor of < 3). 
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The above discussions justify our simple assumption 
cu — £ in the text. 



APPENDIX B: LIKELIHOOD FORMULATION 

We consider a set of n physical quantities, x = (xi, ■ ■ ■ , x„). 
Suppose that those quantities are determined by a set of m 
parameters, y = (j/i, ■ ■ ■ , y m ), each of which has a reason- 
able range y™ m < y; < yf ax (i = 1, ■ ■ ■ , m). The two sets 
of quantities are related to the following function /: 

V = f{x). (Bl) 

We divide x and y into N + 1 and M + 1 bins, respec- 
tively: 

xj = xf n + ^ (a^** - zf") (j = 0, • • • , N) . (B2) 
y\ EE yf« + JL(y™-yf») (fc = 0, ■ • • , M) . (B3) 

The j-th and fe-th bins of a; 4 and y t can be defined as 
the range [x^ 1 , x{\ (j = 1, • ■ ■ , N) and [y^ 1 , j/f] (fc = 
1, • • • , M), respectively. Any A*'- dimensional bin of x can be 
specified by a set of iV integers (ji, • • • , We represent 
the value of y in each bin with the median as 

^,.., M .f«t^,..,ifcLhifcV(«) 



We define iV(ji, • • • , j n ) as the number of the combination 
of m integers (fci, ■ • • , fc m ) such that f(y(ki, ■ ■ ■ , k m )) is in 
the n- dimensional bin (ji, • • • , j„) of a;. Then the likelihood 
of x(ji, ■ ■ ■ , j n ), P(ji, • • ■ , j n ), can be defined as 

■■■->)= X),-!, .... iB ^O'l ■ ■ ■ . in) " (B5) 

Let dS(p) be a contour surface of P — p ((n — 1)- 
dimensional surface), and let S(p) be the area where sur- 
rounded by dS(p). Then, the following sum Prob(S') gives 
the probability that the data x lies in S: 

Pvob(S) = J2 P ( x (ju---,jn)). (B6) 



